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ature is discussed. At room temperature, sequence heterogeneity dominates 
the physics of single molecule force-extension curves starting about 7 piconew- 
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fork exhibits anomalous drift and diffusion in a similar range above this tran- 
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sition. Energy barriers near the transition scale as the square root of the 
genome size. Recent observations of jumps and plateaus in the unzipping of 



lambda phage DNA at constant force are consistent with these predictions. 

O 

I. SINGLE MOLECULE BIOPHYSICS EXPERIMENTS 

The past decade has seen a revolution in biophysics, due to exquisitely sensitive exper- 
iments [1] which probe ingredients of the "central dogma" of molecular biology [2] at the 
level of individual molecules. Although exceptions exist, the central dogma states that bio- 
logical information first flows from DNA to messenger RNA via transcription mediated by a 
RNA polymerase. Information is then transferred from mRNA to proteins via a ribosome- 
mediated translation process. 

Among the efforts to probe the basic constituents further: (a) Proteins anchored to a mi- 
croscope slide in their biologically relevant folded state have been teased apart with atomic 



force probes [3] attached using, e.g., biotin-streptavidin linkages; (b) individual DNA's linked 
to magnetic beads have been stretched and twisted by small magnetic field gradients, allow- 
ing studies of both supercoiling and twist-induced denaturation [4]; and (c) the reversible 
unfolding of single RNA molecules (with beads attached to both ends) has been studied 
using laser tweezers with a feedback loop to generate a constant force F [5]. In this later 
experiment, the fluctuating displacements of a ~25 base-pair RNA hairpin evolve from a 
predominantly closed state (with rare opening events) at small force to a predominantly open 
state (with rare closing events) at larger forces. At an intermediate force of F c 14 piconew- 
tons (pN), the hairpin spends approximately equal time in both configurations, suggestive 
of a first-order phase transition between the open and closed states in the "thermodynamic 
limit" of a very large hairpin. 
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FIG. 1. Schematic of DNA unzipping experiments. The sugar-phosphate backbones, with 
complementary base pairing, are represented by the functions ri(n) and r2(n), where n indexes 
base pairs. Experiments can be carried out either at constant extension ri(0) — ^(0) or at constant 
force F. 



DNA force denaturation experiments related to the subject of this review have been 
carried by Bockelmann et al. [6-8] (see Fig. 1). In this pioneering work, the 48,502 base 
pairs of the virus phage lambda with one strand attached to a microscope slide were unzipped 
by a microneedle attached to the other strand under conditions which produced a constant 
rate of unzipping at a fixed temperature. Reasonable agreement with these experiments 
was obtained by direct numerical evaluation of the equilibrium statistical sums associated 
with a given average degree of unzipping. If the rate of unzipping is slow, such experiments 
effectively probe the physics at constant extension. The average force under these conditions 
locks in at about 14-15 piconewtons. The time-dependent fluctuations of this force as the 
unzipping proceeds provide information about the particular sequence of G:C and A:T base 
pairs being torn apart [6-8] . The full phase diagram of DNA in the force-temperature plane 
is shown in Fig. 2 [9-13]. The thermal DNA melting transition at zero force (see Sec. 2) is 
characterized by diverging length scales associated with denaturation bubbles. When F ^ 0, 
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the native duplex DNA denatures into an unzipped state via a first-order phase transition 
(see Sec. 3) . Under conditions of a slow, constant rate of extension [6-8] , the average force 
adjusts so that the system sits on the heavy first-order transition line F C (T). At any stage in 
the unzipping process, the unzipping fork separating the native and unzipped states is like 
the meniscus dividing, say, liquid and gas phases at constant volume at a bulk first-order 
phase transition. 
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FIG. 2. Phase diagram for DNA denaturation in the force-temperature plane. Conven- 
tional thermal denaturation occurs at zero force as T approaches the melting temperature T rn . 
Force-induced denaturation, or unzipping occurs on the path indicated by the horizontal arrow. 
Sequence heterogeneity dominates the physics between the two dashed crossover lines. 
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FIG. 3. Models of thermal denaturation. (a) Ising model, where right-pointing spins represent 
hydrogen-bonded "helix" segments, and left-pointing spins represent denatured "coil" segments, 
(b) More sophisticated polymer model of denaturation whose statistical mechanics is mapped onto 
a problem in three-dimensional quantum mechanics. Nj,(T) is the number of base pairs associated 
with a typical denatured "coil" segment. 

Although running unzipping experiments in a two-phase region associated with the 
line F C (T) is well-suited for possible DNA sequencing applications [6-8], the full force- 
temperature phase diagram of Fig. 2 is interesting for a number of reasons. First, un- 
derstanding DNA denaturation under conditions of constant force (instead of constant ex- 
tension) could provide insights into the complicated process by which DNA replicates during 
bacterial cell division [15]. Second, statistical mechanics in ensembles with intensive vari- 
ables like the force held fixed is often more tractable than when the control parameter is 
a conjugate variable like the extension. Precise analytic predications are indeed possible in 
the vicinity of F C) even when sequence disorder is present [9,14]. Third, experiments which 
use force as a control parameter can probe interesting behavior of the unzipping fork for 
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F < F C (T) (analogous to "wetting" phenomena in bulk phase transitions [16]) and anoma- 
lous fork dynamics for F > F C (T). Because of significantly different bonding strengths of the 
nucleotide pairs A:T (two hydrogen bonds) and G:C (three hydrogen bonds), heterogeneity 
dominates unzipping over a large region of the phase diagram, for \F C < F ^ §F C , inside 
the region bounded by the dashed lines in Fig. 2. 

II. THERMAL DENATURATION OF DNA 

In this Section we discuss the thermal denaturation of DNA at zero force (vertical arrow 
in Fig. 2). This transition plays an important role in the polymerase chain reaction (PCR), 
where it is applied cyclically to amplify minute amounts of DNA. The basic physical ideas 
driving thermal denaturation (worked out in the 1960s) are described in Refs. [17-19]. The 
simplest models invoke the Ising-like description of DNA shown in Fig. 3a, where right-facing 
spins represent bonded nucleotides in a closed "helix" state and left-facing spins the open 
"coil" state. The magnetic field of the equivalent Ising model represents a (temperature- 
dependent) nucleotide bonding free energy and the Ising exchange energy determines an 
"initiation factor" at the boundaries between the helix and coil regions. The statistical 
mechanics of this one-dimensional Ising model can be described entirely in terms of kink 
excitations associated with transitions between domains of up and down spins, which here 
represent helix/coil boundaries. Rapid variations in DNA from a helix-dominated to a coil- 
dominated state are predicted by such models [19], corresponding to the magnetic field 
changing sign in the equivalent Ising system. However, it is well-known that the Id Ising 
model does not have a real phase transition at finite temperature, basically because the 
finite energy cost of kinks at zero external field is overwhelmed by the entropy gained by 
creating them [20]. An improved theory results when the three-dimensional entropy of 
wandering loops in the coil sections (see Fig. 3b) is taken into account. This can be done by 
approximating the loops as ideal random walks [17], or improving this estimate by taking 
self-avoidance of the loops into account [21]. The effect on the statistical mechanics is to 
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produce a long range interaction which binds kinks together and leads to a continuous finite 
temperature phase transition [17-19]. Recent work by Kafri et al. incorporates additional 
self-avoidance between coil sections and the rest of the DNA chain (important for large 
molecules) and argues that the thermal denaturation transition then becomes first order 



Heterogeneity of the bonding between base pairs (G:C bonds are stronger than A:T ones) 
is neglected in the treatments above. Simple helix-coil descriptions with heterogeneity are 
governed by the statistical mechanics of the Id random field Ising model. A reasonable 
guess for more realistic models with heterogeneity is that while the transition may be less 
sharp (weakly bound regions melt before more tightly bound ones), a real finite temperature 
phase transition survives. Lyubchenko et al. [23] have calculated numerically the effect of 
sequence heterogeneity on the DNA differential melting curves for the 5375 nucleotides of 
the virus 0X174. For a recent theory of thermal denaturation of heterogeneous DNA, see 



In the remainder of this section, we discuss how thermal denaturation of "homogeneous" 
DNA with identical base pairs can also be understood by mapping the statistical mechanics 
onto the derealization of a quantum mechanical particle in three dimensions. This is the 
approach we will generalize in Sec. 3 to study force-induced naturation. 

Referring to Fig. 3b, we parameterize the positions of the two (antiparallel) sugar- 
phosphate backbones of DNA in three dimensions by the functions r^ra) and r 2 (n), where 
n is an integer indexing N base pairs (0 < n < N). We model each strand separately as a 
Gaussian random coil [25] . For example, considering strand 1 in isolation, the probability 
of a particular polymer conformation ri(ra) is proportional to e~ Fl ^ n ^ T (we use units such 
that ks = 1), where 



Here we have used a convenient continuum notation as shorthand for a discrete sum over n 
and replaced n by the arclength s = nd, where al is the spacing between nucleotide monomers. 



[22]. 



Ref. [24]. 




(1) 
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To relate the spring constant K to the single strand persistence length, use (1) to evaluate 
(\n(Nd) — ri(0)| 2 >, with the result 

(\n(Nd)-n(0)\ 2 ) = 3 - 1 ^- (2) 
From the definition of persistance length £, namely (\fi(Nd) — ri(0)| 2 ) = INd, we have [26] 

A' = f , (3) 

where £ m 10A for single-stranded DNA at physiological temperatures. For a more micro- 
scopic "worm-like chain" model with bending rigidity k (assuming a small Debye screening 
length), we have [27,28] 

I - f • (4) 

Here, and in what follows, we use boldface vectors r(s) to denote entire polymer configura- 
tions, and conventional vectors such as f(Nd) to denote the position of a polymer endpoint. 

To describe a DNA duplex, we bring the two strands together, neglect torsional rigidity 
and the helical nature of the bonding [28] , and write the total free energy as 



F[r,( S ),r 2 ( S )] = f/™ (^) 2 + (^) 2 



ds 



+ J Nd U[ ri ( S ) - r 2 (s)} ds, (5) 

where U[ri(s) — r 2 (s)] is a potential which binds the nucleotides in different strands to- 
gether. We assume that the complimentarity of the nucleotide sequences is sufficient to 
bring the strands into registry, but will otherwise neglect the effect of sequence disorder in 
the remainder of this section. If we now pass to sum and difference variables. 

R(8) = \[T 1 (8)+T 2 (8)], (6) 



r(s) = r 2 (s) -ri(s), 
the coarse-grained free energy decouples, 
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(7) 



F[R(s),r(s)} =KJ Nd ds (f ) 2 + \g S ™ ds 

+ J Nd ds U[r(s)}, 



(8) 



where g = K/2. 

The partition function associated with (8) is a path integral of e~ F ^ R,r ^ T over the func- 
tions R(s) and r(s). The part associated with R(s) is simply the partition sum of an 
unconstrained Gaussian coil. The remaining functional integral over r(s) contains a denat- 
uration transition. To pursue this point, we assume for simplicity that r(s = 0) = and 
r(s = ATd) = r (closed boundary conditions at one end, open at the other), and evaluate 



^ i-r(Nd)=f 

Z(0,f;N) = / Drexp 

Jr(0)=6 



g fNd (drY , 1 f Nd TT[ . 



ds. 



(9) 



As in polymer adsorption problems, it is helpful to view Eq. (9) as a Feynman path integral 
for a quantum mechanical particle in imaginary time [27]. Indeed, this partition function 
can be rewritten as a quantum mechanical matrix element [29], 



Z(6,f;N) = (f]e~ ANd / T \Q), 



(10) 



where the effective quantum Hamiltonian is 

— T 2 



H 



2g 



V 2 + U(r) 



(11) 



and |0) and (r\ are respectively ket and bra vectors localized at and f. Note that tem- 
perature plays the role of H and g = \K = |? represents the mass of a fictitious quantum 
mechanical particle in a potential U(r). In the language of statistical mechanics, we have re- 
duced this one-dimensional problem to diagonalizing a transfer matrix given by T = e~ Hd l T . 
We shall focus on the particularly simple binding potential illustrated in Fig. 4, namely 



U(r) 



oo, r < c 
-U , c <r <b 
0, b < r, 
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with Uo = Vo/d, where Vq is the average bonding energy per nucleotide. Upon inserting a 
complete set of energy eigenfunctions \n) = <f> n (f) with eigenvalues e n into Eq. (10), we see 
that this conditional partition function may be rewritten as 



Z(6,f;N) = $»-^/ r |n)(n|0> = £ <P* n (6)Ur)e— Nd/T . 



(12) 



localized, F < F 




(a) 



At/(r) 



delocalized, F>F c 




FIG. 4. Potential well and ground state eigenfunctions for forced-induced denaturation when 
(a) F < F c and (b) F > F c . These (right) eigenfunctions give the probability of a separation r for 
the unzipped ends of the DNA. For F < F c , the ground state eigenfunction decays exponentially 
to zero outside the well, which models the hydrogen bonds and stacking energies which hold base 
pairs together. For F > F c , the eigenfunction tends to a nonzero constant for large r, and the two 
DNA strands fall apart. 
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The ground state dominates in the limit N — > oo, and we have 



Z(O,f;iV)^0 o (O)0o(r)e"^/ T , 



(13) 



where the (nodeless) ground state eigenfunction can be chosen to be real. 

The quantum mechanics of simple square well potentials like Eq. (12) in three dimensions 
is well understood [30]. We focus for simplicity on the case c « b (small hard core diameter) 
but the same qualitative discussion applies more generally. The key parameter controlling 
the eigenfunctions and eigenvalues is the dimensionless ratio 



of the well depth U to the "zero point energy" ■ This quantum zero point energy 
represents the lost entropy associated with a confined DNA duplex. From the perspective 
of quantum mechanics, the loop excitations shown in Fig. 3b represent transient excursions 
(in imaginary time) out of the well. When Q » 1 (low temperatures), the binding energy 
dominates the thermal fluctuations favoring denaturation and there are many bound (i.e., 
localized) eigenstates of H beneath a continuum of extended eigenstates. As Q decreases, 
more and more states delocalize until only the single localized ground state shown in Fig. 
(4a) is present. Eventually, at temperatures high enough so that Q < Q*, even the ground 
state delocalizes [31]. For the simple problem discussed here Q* = ^ (i.e., T m = 8U b 2 /n 2 g) 
[30] . However, a similar Q* of order unity is expected for a broad class of similar potentials. 
Near the transition temperature T m , the ground state energy approaches the continuum 
quadratically with temperature [27, 30] 



where the melting temperature T m is defined by Q{T m ) = Q*. For large r, the ground state 



Q = 



2gU b 2 

J^2 



(14) 



eo CO ~ —const. I T m — 




(15) 



wave function decays exponentially, <fio(r) 



e K(, l^, where 



^(O 



— T 2 1 



(16) 



^ 2^0(0 (Tm-T) 
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gives the spatial extent (in three dimensions) of the bubbles shown schematically in Fig. 3b. 
The number of base pairs N b in a bubble (see Fig. 3b) can be estimated by identifying N b 
with the value of N needed to achieve ground state dominance in Eq. (12): We expect that 
boundary effects become unimportant when the number of base pairs exceeds the number 
in a typical bubble. The ground state will dominate once the number of nucleotides is larger 
than several bubble sizes. Since the energy of the first excited state for Q > Q* is zero, we 
have 



Z(6,f;N) ^ o (O)0 o (r)e^ 



-\en\Nd 



(17) 



from which it follows that 



T 1 

N b « ~ -= (18) 

d\e {T)\ (T m — T) 2 

We should stress that the above results are obtained in an approximation which neglects 
self-avoidance effects important for large molecules close to T m . We expect, however, quali- 
tatively similar behavior even when self-avoidance is taken into account. We now use similar 
methods to treat force-induced denaturation in Section 3. 



III. FORCE-INDUCED DENATURATION 



A force F applied across the two strand endpoints (see Fig. 1) adds an energy — F ■ r, 
and Eq. (9) becomes 



Z(0,r;N)=J^Dr(s)e W 



-%fo Nd (£) 2 ds-±j Nd UMs)]ds 



(19) 



We assume the attached strands of DNA can swivel freely to relax twist on an experimental 
time scale, so that we can neglect the helical nature of the duplex state when the force is 
applied. In addition to the force term (note that the lower limit does not contribute with 
our boundary conditions), we have added a subscript s to the binding potential, C/[r(s)] — > 
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U s [r(s)], to emphasize that the depth and size of potentials like that in Fig. 4 will depend 
on the particular base pair for heterogeneous DNA. After taking over results for a virtually 
identical problem [32] of flux line depinning by a columnar pin in a transverse magnetic field 
in high T c superconductors, we obtain [9,10] 

T dz{ dNd N) = = £r( f -j) r1 N) + Un ^ z & f; ^ 2 °) 

which can be integrated (subject to the boundary condition Z(6,f; 0) = 1) to give a result 
with the form of Eq. (10), where 

. - — T 2 fV 

Equation (20) with an s-dependent potential U s [r\ can be studied (via a mapping onto 
a Burgers equation) by the methods of Ref . [33] , where it arose in a study of vortex depin- 
ning from fragmented columnar pins. In this section, we neglect the s-dependence (letting 
UNdif) —> U{f)) and illustrate force-induced denaturation for the homopolymer duplex 

— * 

discussed in Sec. 2. Because F appears as a constant imaginary "vector potential," Eq. 
(21) represents a non-Hermitian generalization of Schroedinger's equation. To establish the 
existence of a first-order transition, we write the solutions of (20) using the right and left 
eigenfunctions {^^(r)} and {0^(r)} (with a set of common eigenvalues {E n (F)}) of H{F), 

Z(0, f; N) = £ €L$)€ni?)e- E ^ Nd ' T . (22) 

n 

We again invoke ground-state dominance and focus on the behavior of the lowest eigenvalue, 
which determines the duplex free energy per unit length a(F) via 

a(F) = -T^ o ± i Hmn, (23) 
where Z(N, F) is defined by integrating over r, 

Z(N,F) = J d 3 rZ(6,f;N), (24) 

since we work at constant force instead of constant extension. 
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For F less than a critical value F c (determined below), we can obtain the (right) eigen- 
functions <?^ R (r) for F ^ from localized eigenfunctions <j) n {r) for the Hermitian case F = 
via a "gauge transformation" [32]. Indeed, it is easy to check that 

€r{?) = e^Mrl (25) 

solves 

H{F)€r{?) = E n 4> F M (26) 

(provided (f> n (f) satisfies H(0)<f> n (f) = e n <f> n (r)) with the same eigenvalue, E n = e n . In 
particular, we obtain from Eq. (23) a negative F-independent free energy per unit length 

— * 

a = e = —\e \ for small F. If, however, a localized ground-state eigenfunction for the 
Hermitian F = problem decays like (po(r) ~ e" K0 '^ for large r, the new eigenfunctions 
(25) are only normalizable provided F < F C (T) = Tn (T). Note that <f>Q R (r), which gives 
the probability of finding an extension r, [32] is displaced in the direction of F, as indicated 
schematically in Fig. 4a. To determine what happens for F > F c , we check for a nodeless 
extended ground-state wavefunction, as indicated in Fig. 4b. If (pQ R (r) — > const. > for 
large r (i.e., the DNA duplex falls apart), and lim^oo U(r) = 0, evaluating Eq. (22) in this 
limit gives H(F)0q R (F) = -^-0qr(?), so that a(F) = E (F) = The two free energies 

a(F) for F < F c and F > F c are plotted in Fig. 5 [32]. At F c the two curves intersect at a 
nonzero angle, the classic signature of a first-order phase transition. The latent heat of this 
transition might be observable in a solution of many DNA duplexes pulled apart by, say, 
beads of opposite charge in an electric field. Since the first-order transition curve F C (T) is 
given by —F^/2g = — |e (T% where e is the smallest eigenvalue of the transfer matrix for 
F = 0, we find using Eq. (15) that 

F C (T) oc const. | T m -T\ (27) 

near the melting transition for this model. 
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FIG. 5. The constant free energy a = — |eo| P er base of duplex DNA compared to the parabolic 
free energy a(F) = —F 2 /2g appropriate for two separated strands. See the text for a discussion 
of more sophisticated freely-jointed chain and worm-like chain models of this free energy. The 
two free energies cross with discontinuous slope at F C (T), the location of the first-order unzipping 
transition. 

Even though force-induced denaturation is a first-order transition, there are diverging 
precursors in the form of DNA unzipping (see Fig. 1). The probability of an endpoint 
separation |f| in the limit A" — > oo is given integrating \r\ weighted by the ground state 
eigenfunction, 



4>or =eT<j> (i f ) 



g T g K 1 r I 



It is then straightforward to show that (\f\) diverges as F — > F c (T) [32] 



(28) 



(29) 



F C (T)-F' 

Upon examining corrections to ground state dominance as in the analysis of Eq. (19), we 
find that the average number m of unzipped monomers diverges, as shown in Fig. 6a, 



(m) 



d(|eo|-F2/2g) 

1 

^ F C (T)-F- 



(30) 
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(b) 



F 



FIG. 6. Thermal average (m) of the location of the unzipping fork, (a) Divergence of (m) at F c 
for a homopolymer. (b) Jumps and plateaus in (m) (with, on average, a much strong divergence) 
in the presence of sequence disorder. 

The first-order force-induced DNA denaturation transition can be understood more gen- 
erally, independent of the mapping onto quantum mechanics. If ai(T,F) is the free energy 
per unit length of one of the single-stranded DNA "handles" shown in Fig. 7, and a (T) 
is the (force- independent) free energy per unit length of the double-stranded DNA which 
coexists in a macroscopically unzipped state, then the condition for two-phase coexistence 
across the "meniscus" or unzipping fork at the critical force F c is [14] 
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ao(T) = 2ai(F c ,T). 



(31) 



The physics of the handle free energy ai(F,T) is relatively simple, provided the force is 
large enough to neglect self-hybridization and self-avoidance of the single-stranded DNA. 
For example, we could take ai(T,F) to be the free energy of a freely-jointed chain with 
persistance length I [25] 



which is proportional to —£F 2 /T for small F. Alternatively one could use a more sophis- 
ticated "worm-line chain" approximation valid over a larger range of forces [34]. With an 
acceptable approximation to a\(T,F) in hand (if necessary, this function could be obtained 
directly from integrating single strand force-extension curves [35]), one can use the experi- 
mentally determined phase boundary F C (T) for unzipping in conjunction with Eq. (31) to 
explore the temperature dependence of the free energy a (T) of double-stranded DNA. 



FIG. 7. Two-phase coexistence of duplex DNA with free energy oer base pair ao and two 
unzipped "handles," each with free energy per base a±. 




(32) 



ma, 



ma 
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IV. UNZIPPING WITH SEQUENCE HETEROGENEITY 



The effect of sequence heterogeneity on thermal denaturation of DNA at zero force is a 
subtle and still not completely resolved problem, [19,23,24]. Understanding inhomogeneous 
base pairing energies near the first-order force-induced unzipping transition appears to be 
more tractable. The integrated effect of sequence randomness on the energy landscape for 
long strands of DNA does not change the underlying first-order transition, because it only 
produces corrections of order ^= to the free energies per unit length discussed above, where 
iV is the total number of base pairs. However, sequence randomness has drastic consequences 
for quantities like (m) which characterize the unzipping process itself [9,14]. Here, we simply 
sketch the overall picture. Detailed calculations and justifications for approximations (such 
as the neglect of bubbles in DNA) can be found in Ref . [14] . 

Consider the energy e(m) associated with the conditional partition sum of a DNA du- 
plex which has been unzipped by m base pairs. If the DNA is a homopolymer, consisting 
exclusively of A:T or G:C base pairs, the total energy can be read off from Fig. 7, 

£{m) = 2ma\ + (N — m)ao 

= a N + fm (33) 

where 

f = 2ai (F)-a (34) 

can be expanded to give / ~ c^j^\ Fc {F c — F) near the unzipping transition. If we now add 
sequence randomness, a reasonable model for the energy of unzipping is 

A£(m) = £{m) - Na 

— fm+ / dm'r}(m') : (35) 
Jo 

where deviations from the average A:T or G:C content are represented by the function 77(77?/) 
in a convenient continuum notation. To the extent that most DNA sequences which code 

18 



for proteins have deviations from the average G:C/A:T content describable as a random 
walk [36], we expect that the statistical properties of 77(771) at large distances are those of a 
white-noise random variable, 



7](m)i](m') = A 5(m — m) (36) 

where the overbar represents a "disorder average" over all possible sequences of base pairs. 
Because G:C and A:T pairing energies differ by an amount of order T, A ~ T 2 , where we 
have used the temperature T ~ 293-3 10°K characteristic of most experiments to characterize 
this quantity. The partition function associated with this simple one-dimensional model 
involves integrating over all possible unzipping lengths m, 

POO 

Z(f) = / dme- A£ ^ T . (37) 
J 

The unzipping energy in Eq. (35) is plotted for a particular base pair sequence (with 
A = T 2 ) in Fig. 8, with f/T = 0.01. The straight line is the energy of a homopolymer 
duplex with A = and 

A£(m) = fm. (38) 

In this case we find immediately from Eq. (37) that the thermally averaged degree of 
unzipping is 

^ 7 ~ l^ry m 

in agreement with Eq. (30). There are large fluctuations about this average value, as one 
can check by evaluating 



y/((m-(m))*)=T/f=(m). (40) 
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FIG. 8. Energy landscape for F < F c for a homopolymer duplex (straight line) and with the 
addition of inhomogeneous base pairing energies due to a particular DNA sequence (jagged line). 
Here we take A = T 2 and f/t = 0.01. 



The rugged landscape in Fig. 8 for A ^ is more interesting. Although the positive 
background slope determined by / insures that the DNA remains macroscopically unzipped, 
the integrated effect of sequence randomness produces deviations from a straight line which 
scale like y/m. These fluctuations lead to deep minima at nonzero m with large energy 
barriers in between. A downward fluctuation in this "integrated random walk" energy 
landscape at position m corresponds to an energy of order 



Ae(m) ~ fm — \jA m. (41) 



Upon minimizing over m, we obtain the estimate 



Aq 1_ 

P ~ (F c - Ff 



<™> ~ ^ ~ t tp w (42) 



a result confirmed by more elaborate calculations [9,14]. Here, the overbar represents an 
average over a quenched random distribution of DNA sequences. It can also be shown that 
thermal fluctuations about this average are more constrained than for homopolymers, in the 
sense that 

y/((m- <™» 2 > ~ ~ W- (43) 
Thus, in contrast to homopolymers, where ((m — (m)) 2 )/ (m) 2 = 0(1) we now have 
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((to - (m)) 2 )/((mf = 0(1/ (mf 2 ). 



(44) 



A typical energy barrier associated with disorder-dominated unzipping is /(to) + y A (to) ~ 
y 1 . Results such as (42) apply when this scale exceeds T, i.e., for / less than a crossover 
force f x , 



Because A = 0{T 2 ) is large for G:C and A:T base pairs, the range of forces where sequence 
heterogeneity dominates the physics is the large region bounded by the left-hand dashed 
line in Fig. 2. Note that disordered dominated prediction for (to) in Eq. (42) diverges with 
a power which is twice as large as the result (39) appropriate for thermally equilibrated 
homopolymers. 

Results such as Eq. (42) apply only to a quenched average over an entire library of 
different DNA sequences unzipped in parallel. We can understand the behavior for F < 
F C (T) for the more experimentally accessible case of a particular squence from Fig. 9, which 
shows the energy landscape for two identical sequences with the biases, f /T = 0.01 and 
f/T = = 0.006. The degree of unzipping (to) is dominated by the minimum indicated by 
an arrow in each case. Somewhere between / = 0.01 and / = 0.006, the average of (to) jumps 
from one minimum to the next. More detailed calculations [14] reveal an entire sequence of 
jumps and plateaus, as illustrated in Fig. 6b. Although a best fit to a power law of this 
irregular curve should reveal the exponent of Eq. (42), the plateaus and jumps themselves 
represent a rough "fingerprint" of the individual sequence. See Ref. [14], where the statistical 
distribution of plateaus and jumps are evaluated using the techniques of LeDoussal et al. 



f ^ fx rr ' 



(45) 



[37]. 
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(b) 

FIG. 9. Energy landscapes two identical DNA sequences with Ao = T 2 and (a) f/T = 0.01 
and (b) f/T = 0.006. The arrow indicates the energy minimum which dominates the value of (m) 
in each case. 

V. DYNAMICS OF UNZIPPING 

What happens for F > F C (T)? In Figure 10 we show an energy landscape (for the same 
sequence as in Fig. 8) with f/T = —0.01, together with the purely downhill landscape 
of the corresponding homopolymer. Because the average free energy per base pair of the 
DNA duplex exceeds the combined free energies per base of the two single-stranded DNA 
"handles" in Fig. 7, the system is unstable to complete unzipping. It is then appropriate to 
discuss the dynamical process by which unzipping proceeds on this downhill path. 
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c 

FIG. 10. Downhill energy landscape appropriate for are F > F C (T) Although the overall slope 
is negative, there are many traps, such as those at positions A, B, and C. Energy barriers between 
these traps scale as \fK$m, where m is the number of bases between traps. 

Subject to a number of simplifying conditions discussed in Ref. [14], the time dependence 
of the unzipping fork can be described by an overdamped Langevin equation, 



where T is a damping constant (T = 1/ (tT), where r is a microscopic relaxation time) and 
the noise correlations are 



Upon substituting for the energy landscape £(m) from Eq. (35), we consider the case / < 
and obtain 



which is the equation of motion for a particle with coordinate m(t) executing one-dimensional 
biased diffusion in a random force field proportional to r](m). 

A great deal is known about this problem [38-40]. In the absence of sequence hetero- 
geneity (A = in Eq. (36)), m(t) exhibits diffusion with drift at long times, i.e., 




(46) 




(47) 




(48) 




t 



(49) 



lim 



<[m(t) - (m(0))] 2 ) 



= 2D 



(50) 



t 



23 



with a well-defined drift velocity v = T\f\ and diffusion constant D = TT. This is the 
expected behavior for the straight line homopolymer energy landscape shown in Fig. 10. 
Sequence heterogeneity, however, has a dramatic effect on the dynamics unless |/| is large. 
Indeed, the integrated random walk landscape in Fig. 10 has many deep minima, even 
though the average slope is negative. When |/| is small, typical energy barriers to travel 
a distance m scale like y/Aom, as is reflected in the small barrier connecting the minima 
labelled A and B and the much larger barrier between these minima and the more dis- 
tant minimum C. A detailed analysis [38-40] reveals three types of anomalous dynamics, 
depending on the dimensionless parameter 



FIG. 11. Experimental setup for the force DNA unzipping experiments of Danilowicz et al. 
[42]. The unzipping trajectories of dozens of genetically identical lambda phage DNAs attached to 
a thin capillary tube are tracked. A constant force is generated by attaching magnetic beads and 
placing the apparatus in a magnetic field gradient. 

Right at the unzipping transition, \i = \f\ = and the unzipping fork wanders sub- 
diffusively according to [39] 



2T|/| 



(51) 



A 



A 





(52) 
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because of trapping effects arising from the large energy barriers. The conventional diffusion 
constant D, as defined by Eq. (50), vanishes. When < /i < 1, the unzipping fork drifts 
downhill, but does so sublinearly with time, 

(m(t)) ~ const. t M , (53) 

so the usual drift velocity v defined by (49) vanishes, with an additional anomaly in the 
spread about this average drift [39,40]. When this bias is large enough so that 1 < pL < 2, 
one recovers a well-defined drift velocity v, but with super diffusive spreading, 

([m(t)-(m(tW) r (54) 

Conventional diffusion with drift is only recovered for forces large enough so that ji > 2, i.e., 
for 

~f>f x = ^- (55) 

This critical force is the same order of magnitude as the crossover scale defined below the 
unzipping transition by Eq. (45) and leads to the right-hand dashed line in Fig. 2. The pa- 
rameters of DNA are such that anomalous unzipping appears in a very large range (compared 
to typical disorder effects in conventional critical phenoman [41]), roughly \F C < F < |F C . 
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FIG. 12. Unzipping histories obtained from the apparatus of Fig. 11 for (a) three beads with 
F =15 pN and (b) four beads with F =20 pN. Note the long-time scale (of order 1 hour) and 
the many jumps and plateaus. The oscillations arise when the unzipping fork encounters two 
nearly degenerate energy minima. The same plateaus recur for different beads at the same force, 
suggesting that these are due to sequence disorder (from Ref. [42]). 
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VI. PAUSES, JUMPS AND CONCLUDING REMARKS 



Experiments which study the mechanical denaturation of lambda phage DNA at constant 
force have recently been carried out in the group of Mara Prentiss [42] . Digoxigenin linkers 
attach several dozen identical copies of this DNA along a long thin capillary tube. Each 
DNA to be unzipped is offset by an additional DNA spacer (see Fig. 11). Biotin links 
the other end of the DNA to magnetic beads. Unzipping at constant force is achieved by 
applying a uniform magnetic field gradient, causing the beads to stretch out at right angles 
to the capillary tube, like a series of flags on a flagpole. 

Figure 12a records some unzipping histories at T = 20° C for three different beads at 
different heights on this "flagpole" with F — 15 pN, corresponding approximately to the 
critical unzipping force for the front half of the sequence for phage lambda. (The back half of 
this DNA has a lower G:C content.) The beads are well separated, and are clearly subject to 
microscopically different thermal histories. Nevertheless, there are distinctive pauses (some 
lasting 15 minutes or more) and jumps common to the different beads. As the individual 
DNA's unzip, the beads occasionally jump back and forth, suggesting two nearly degenerate 
minima in the energy landscape about 1000 base pairs apart. This behavior is similar to 
that found for much shorter RNA hairpains in Ref. [5]. The long pauses before unzipping 
resumes are consistent with large energy barriers, of order 20-30 T. In contrast, unzipping 
at a slow constant velocity is an imperative in the work of Bockelmann et al. [6-8]. Because 
these experiments are effectively at constant extension, the force in this case fluctuates as 
needed about F C {T) to insure that all energy barriers are overcome. 

Many of the pauses and local "two-level system" oscillations found by the Prentiss group 
coincide with those predicted for phage lambda DNA [42], using energy landscapes obtained 
from the base pairing and stacking energies of Santa Lucia et al. [43] . Overall, these exper- 
iments seem consistent with important aspects of the theory reviewed here, such as large 
energy barriers and sequence-specific pause points. We should stress that longer run times 
or higher temperatures are necessary to obtain true thermodynamic equilibrium. Figure 
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12b shows the unzipping history of four different beads at F ~ 20 pN . There are again 
jumps between common plateaus as well as oscillations suggesting nearly degenerate min- 
ima. Although 20 pN should be above the thermodynamic critical force F C (T), the DNA 
nevertheless fails to unzip completely on experimental time scales, presumably due to the 
ruggedness of "downhill" landscapes such as that in Fig. 10. Predictions such as Eq. (42) 
are only be applicable to systems in true thermodynamic equilibrium. 

Experimental checks of phase diagrams such as that in Fig. 2 and the associated physical 
predictions would be of considerable interest. As mentioned in Sec. 3, the phase diagram 
itself can be used to directly measure the temperature-dependent free energy of the DNA 
duplex at zero force. Analysis of this free energy is simplified if a theory of the unzipped 
handles is available. Analytic theories require forces large enough to stretch out the handles 
and prevent hairpins created by self-hybridization. The self-hybridization of ssDNA (for 
F < 10 pN) indicated in the experiments of Maier et al. [44] could be reduced by introducing 
single stranded binding proteins to smooth out the heterogeneity of the handles. 

The assumption (36) of short-range correlations in the sequence is an approximation 
valid only for simple "coding DNA" in bacteria or viruses. The introns or so-called "junk 
DNA" present in many eukaryotic organisms may be better described by [36] 

r)(m)r)(m') ~ 1 ,, 2 _ 20 , (56) 

with p > 1/2. Although we are unaware of detailed calculations, a straightforward general- 
ization of the argument leading to Eq. (42) now gives [9] 

For (3 < 1/2, we expect that predictions for sequences with short-range correlations will 
apply. The exponent f3 = 0.55 seems to describe the energy correlations of some intron-rich 
sequences [36]. 

Ideas similar to those discussed here can also be applied to force-induced denaturation 
[45,14] of the multiple stem- loop structures which define the secondary structure of compli- 
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cated RNA enzymes. Here, however, strings of RNA hairpins are unzipped simultaneously 
and the discrete jumps and plateaus in (m) near the unzipping force are averaged out. 
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